Transforming RuThes Thesaurus to Generate Russian WordNet
نویسنده
چکیده
In this paper we describe the semi-automatic process of transforming the Russian language thesaurus RuThes to WordNet-like thesaurus, called RuWordNet. In this procedure we attempted to achieve two main characteristic features of WordNet-like resources: division of data into part-of-speech-oriented structures with cross-references between them and providing a set of relations similar to WordNet-like resources.
منابع مشابه
RuThes Linguistic Ontology vs. Russian Wordnets
The paper describes the structure and current state of RuThes – thesaurus of Russian language, constructed as a linguistic ontology. We compare RuThes structure with the WordNet structure, describe principles for inclusion of multiword expressions, types of relations, experiments and applications based on RuThes. For a long time RuThes has been developed within various NLP and informationretrie...
متن کاملComparing Two Thesaurus Representations for Russian
In the paper we presented a new Russian wordnet, RuWordNet, which was semiautomatically obtained by transformation of the existing Russian thesaurus RuThes. At the first step, the basic structure of wordnets was reproduced: synsets’ hierarchy for each part of speech and the basic set of relations between synsets (hyponym-hypernym, partwhole, antonyms). At the second stage, we added causation, e...
متن کاملDetermining the Most Frequent Senses Using Russian Linguistic Ontology RuThes
The paper describes a supervised approach for the detection of the most frequent senses of words on the basis of RuThes thesaurus, which is a large linguistic ontology for Russian. Due to the large number of monosemous multiword expressions and the set of RuThes relations it is possible to calculate several context features for ambiguous words and to study their contribution to a supervised mod...
متن کاملУточнение русскоязычных словарей эмоциональной лексики с использованием тезауруса RuThes (Refinement of Russian Sentiment Lexicons Using RuThes Thesaurus)
The paper describes a combined approach to extraction of a domain-specific sentiment lexicon. At first, an initial version of a domainspecific lexicon is obtained by application of a supervised model. At the second stage, the ordered list of sentiment words is refined using the thesaurus information. This combined model is applied to several domains and at last the domain-specific sentiment lex...
متن کاملCreating a General Russian Sentiment Lexicon
The paper describes the new Russian sentiment lexicon RuSentiLex. The lexicon was gathered from several sources: opinionated words from domain-oriented Russian sentiment vocabularies, slang and curse words extracted from Twitter, objective words with positive or negative connotations from a news collection. The words in the lexicon having different sentiment orientations in specific senses are ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017